Reimplement Atlas WPWM and Z 13 TeV TOT #2207

comane · 2024-11-09T20:08:22Z

Implementation agrees with legacy (t0-covmat included):

Z TOT Benchmarks

(this branch) https://vp.nnpdf.science/edlFDxSrRoK3stoHz-HtdQ==
(master) https://vp.nnpdf.science/JhZHlSMRQMigrmyn1OOcLg==

WPWM TOT Benchmarks

(this branch) https://vp.nnpdf.science/HM0ylMDJSvGocbHKjowGIQ==
(master) https://vp.nnpdf.science/MX4e1EQaQrafwPSwXihhTw==

scarlehoff · 2024-11-12T12:00:49Z

nnpdf_data/nnpdf_data/commondata/ATLAS_WPWM_13TEV/kinematics.yaml

@@ -0,0 +1,25 @@
+bins:
+- k1:


You can remove the k1.

scarlehoff · 2024-11-12T13:13:10Z

nnpdf_data/nnpdf_data/commondata/ATLAS_WPWM_13TEV/filter_utils.py

+    """
+    kin = []
+
+    mw2 = 80.385**2


Please, put this as a module (or even at the filter_utils level) variable, MW2 = ... so that it can be modified for many datasets at once.

scarlehoff · 2024-11-12T21:45:03Z

finally we add the lumi unc as MULT SPECIAL

Sorry, why "special"? What's wrong with ATLASLUMI13? That will correlate it with the other 13 TeV datasets.

enocera · 2024-11-27T15:46:13Z

Current implementation disagrees with legacy version.

@scarlehoff and @enocera I tried to reimplement this dataset and got some different covariance matrix. I tried to have a look at what the old implementation in build master was doing but failed to really understand. What I particularly failed to understand was the way in which the covariance matrix was constructed, see lines 93-98 of ATLAS_WZ_TOT_13TEV.cc in the old build master. Below I describe the way I implement the covariance matrix by following what I understood from the paper and extra sources

In this new implementation the covariance matrix is constructed in the following way:
* get correlation matrix for systematics from https://atlas.web.cern.ch/Atlas/GROUPS/PHYSICS/PAPERS/STDM-2015-03/tabaux_03.pdf  -> corr matrix for systematics but not for lumi

* given the systematics in Table 3 of paper: https://arxiv.org/abs/1603.09222 we first build the stat covmat as


1. stat_cov = np.diag([stat_wm**2, stat_wp**2])

2. syst_cov = corr_matrix * np.outer(np.array([syst_wm, syst_wp]), np.array([syst_wm, syst_wp]))


* given the stat + sys cov we decompose it and write the artificial sys as CORR MULT

* finally we add the lumi unc as MULT SPECIAL

Dear @comane as far as I understand, with this procedure you do take into account the correlation between the W+ and W- cross sections, but you do not include the correlation with the Z cross section. My suggestion (and I think that this is what was done in the old buildmaster implementation) is to extend your procedure to Z like

syst_cov = corr_matrix * np.outer(np.array([syst_wm, syst_wp, syst_z]), np.array([syst_wm, syst_wp, syst_z]))

This will construct the total systematic covariance matrix for W+, W- and Z. Now, because CC and NC DY must go in different data sets (the reason being the way in which we correlate MHOUs), you need to generate artificial systematics from the total systematic covariance matrix and correlate these across the two data sets (one for W+- and one for Z). The decomposition into artificial systematics will give you three arrays with three systematic uncertainties each. If the order is W+, W- and Z, you put the two first arrays into the W+ and W- data set, and the thrid array into the Z data set. These uncertainties should have the same name (you choose which one) across the board. Then you attach the stat uncertainty to each data point (no need to construct a covmat) and the lumi uncertainty.

comane · 2024-12-08T16:59:33Z

Current implementation disagrees with legacy version.
@scarlehoff and @enocera I tried to reimplement this dataset and got some different covariance matrix. I tried to have a look at what the old implementation in build master was doing but failed to really understand. What I particularly failed to understand was the way in which the covariance matrix was constructed, see lines 93-98 of ATLAS_WZ_TOT_13TEV.cc in the old build master. Below I describe the way I implement the covariance matrix by following what I understood from the paper and extra sources
In this new implementation the covariance matrix is constructed in the following way:
* get correlation matrix for systematics from https://atlas.web.cern.ch/Atlas/GROUPS/PHYSICS/PAPERS/STDM-2015-03/tabaux_03.pdf  -> corr matrix for systematics but not for lumi

* given the systematics in Table 3 of paper: https://arxiv.org/abs/1603.09222 we first build the stat covmat as


1. stat_cov = np.diag([stat_wm**2, stat_wp**2])

2. syst_cov = corr_matrix * np.outer(np.array([syst_wm, syst_wp]), np.array([syst_wm, syst_wp]))


* given the stat + sys cov we decompose it and write the artificial sys as CORR MULT

* finally we add the lumi unc as MULT SPECIAL
Dear @comane as far as I understand, with this procedure you do take into account the correlation between the W+ and W- cross sections, but you do not include the correlation with the Z cross section. My suggestion (and I think that this is what was done in the old buildmaster implementation) is to extend your procedure to Z like

syst_cov = corr_matrix * np.outer(np.array([syst_wm, syst_wp, syst_z]), np.array([syst_wm, syst_wp, syst_z]))

This will construct the total systematic covariance matrix for W+, W- and Z. Now, because CC and NC DY must go in different data sets (the reason being the way in which we correlate MHOUs), you need to generate artificial systematics from the total systematic covariance matrix and correlate these across the two data sets (one for W+- and one for Z). The decomposition into artificial systematics will give you three arrays with three systematic uncertainties each. If the order is W+, W- and Z, you put the two first arrays into the W+ and W- data set, and the thrid array into the Z data set. These uncertainties should have the same name (you choose which one) across the board. Then you attach the stat uncertainty to each data point (no need to construct a covmat) and the lumi uncertainty.

@enocera thank you for the detailed explanation.

achiefa · 2024-12-09T14:24:39Z

Hi @comane, is this PR ready for review?

comane · 2024-12-09T14:49:19Z

Hi @comane, is this PR ready for review?

Yes it is. As written in the PR comment, the new implementation agrees with the old one apart for the t0 covariance matrices.
The reason for the disagreement between these is that in the old implementation uncertainties were taken as MULT, however, by looking at the paper (see Table 3) it seems to me that they are given in additive form.

achiefa · 2024-12-09T15:21:44Z

old implementation uncertainties were taken as MULT, however, by looking at the paper (see Table 3) it seems to me that they are given in additive form.

I see. However, I remember a similar situation where the uncertainties were nonetheless multiplicative. In other words, expressing the uncertainties in absolute value is not a sufficient condition for them to be additive. I might be wrong though, so I summon @enocera.

comane · 2024-12-09T15:39:01Z

old implementation uncertainties were taken as MULT, however, by looking at the paper (see Table 3) it seems to me that they are given in additive form.

I see. However, I remember a similar situation where the uncertainties were nonetheless multiplicative. In other words, expressing the uncertainties in absolute value is not a sufficient condition for them to be additive. I might be wrong though, so I summon @enocera.

Yes, this might be the case. Perhaps @enocera remembers this detail!

achiefa

Ok, I left a few comments and some questions that we can discuss.

nnpdf_data/nnpdf_data/commondata/ATLAS_WPWM_13TEV/metadata.yaml

achiefa · 2024-12-09T16:28:17Z

nnpdf_data/nnpdf_data/commondata/ATLAS_Z0_13TEV/filter.py

+
+yaml.add_representer(float, prettify_float)
+
+MZ2 = 91.1876**2


I start wondering whether we should use a common source for these parameters. I usually take min and max values of the mass as indicated in the paper, and then take the mean value. Honestly, I don't know what is better. It's not a big issue though, as this is used only for the x,Q2 map. What do you think?

In this case I literally just took the value that was being used in the legacy version.

But this is a good point. I agree with you that it would be nice to collect this values / constants in a file (eg filter_utils) so as to use them in a consistent way..

Sorry I thought I had already replied to this.
I think its a good idea in principle.
Here I really just took the value that was already used in the legacy version for the kinematics

nnpdf_data/nnpdf_data/commondata/ATLAS_Z0_13TEV/metadata.yaml

enocera · 2024-12-11T16:29:19Z

old implementation uncertainties were taken as MULT, however, by looking at the paper (see Table 3) it seems to me that they are given in additive form.

I see. However, I remember a similar situation where the uncertainties were nonetheless multiplicative. In other words, expressing the uncertainties in absolute value is not a sufficient condition for them to be additive. I might be wrong though, so I summon @enocera.

Yes, this might be the case. Perhaps @enocera remembers this detail!

@achiefa @comane

General considerations. The way in which an experimental uncertainty is presented (absolute or percentage) is not an indication of its additive or multiplicative nature. Additive uncertainties can be presented as absolute or percentage values, not necessarily only as absolute values; and likewise multiplicative uncertainties can be presented as absolute or percentage values, not necessarily only as percentage values. If you are undecided whether an uncertainty is additive or multiplicative, the NNPDF convention is to set it to multiplicative. The reason being that it is worse to treat as additive an uncertainty which is actually multiplicative than the other way round because of the D'Agostini bias. If you have artificial systematic uncertainties, determined from the decomposition of a covariance matrix, these must instead be additive, because otherwise the original covariance matrix cannot be reproduced because of the t0 prescription.

Specific considerations. As far as I understand, in the data set under discussion you have three uncertainties: the statistical uncertainty (stat) a correlated systematic uncertainty (sys_1) and the luminosity uncertainty (sys_2). The correlated systematic uncertainty must be used with the correlation matrix to generate a covariance matrix, which is decomposed into additive artificial systematic uncertainties, as I explained above; the luminosity uncertainty should be implemented separately, treated as multiplicative (100%) correlated. I understand that this is what you did.

comane · 2024-12-11T16:57:25Z

Thanks @enocera and @achiefa,
I am now treating the sys as: ADD (stat), MULT (lumi), ADD (sys).

The dataset agrees in all (t0-included) with the previous implementation.

comane added data toolchain ATLAS_DY_DATA labels Nov 9, 2024

scarlehoff reviewed Nov 12, 2024

View reviewed changes

comane force-pushed the reimplement_ATLAS_WPWM_13TEV_TOT branch from 956a26f to 5ecf049 Compare December 8, 2024 16:46

comane changed the title ~~[WIP] Reimplement Atlas WPWM 13 TeV TOT~~ Reimplement Atlas WPWM and Z 13 TeV TOT Dec 8, 2024

comane added regenerate-data and removed regenerate-data labels Dec 8, 2024

comane mentioned this pull request Dec 8, 2024

Final revision of the 4.0 dataset #2242

Open

5 tasks

comane requested a review from achiefa December 8, 2024 16:58

comane requested a review from enocera December 8, 2024 16:59

achiefa reviewed Dec 9, 2024

View reviewed changes

comane added 7 commits December 9, 2024 17:38

added data and kinematics and code to generate it

41f589c

added covariance matrix decomposition

e50fa94

correlation of Z taken into account => artificial sys

5a27d17

added filter and filter utils for Z TOT, unc are correlated with WPWM

3c5fbf1

added new data, unc and kin files

1f0266a

added table 11

c782340

corrected latex labels

bebefea

comane force-pushed the reimplement_ATLAS_WPWM_13TEV_TOT branch from bec580f to bebefea Compare December 9, 2024 17:38

add MULT unc for LUMI

6118623

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reimplement Atlas WPWM and Z 13 TeV TOT #2207

Reimplement Atlas WPWM and Z 13 TeV TOT #2207

comane commented Nov 9, 2024 •

edited

Loading

scarlehoff Nov 12, 2024

scarlehoff Nov 12, 2024

scarlehoff commented Nov 12, 2024

enocera commented Nov 27, 2024

comane commented Dec 8, 2024

achiefa commented Dec 9, 2024

comane commented Dec 9, 2024

achiefa commented Dec 9, 2024

comane commented Dec 9, 2024

achiefa left a comment

achiefa Dec 9, 2024

comane Dec 9, 2024

comane Dec 10, 2024

enocera commented Dec 11, 2024 •

edited

Loading

comane commented Dec 11, 2024


		yaml.add_representer(float, prettify_float)

		MZ2 = 91.1876**2

Reimplement Atlas WPWM and Z 13 TeV TOT #2207

Are you sure you want to change the base?

Reimplement Atlas WPWM and Z 13 TeV TOT #2207

Conversation

comane commented Nov 9, 2024 • edited Loading

Z TOT Benchmarks

WPWM TOT Benchmarks

scarlehoff Nov 12, 2024

Choose a reason for hiding this comment

scarlehoff Nov 12, 2024

Choose a reason for hiding this comment

scarlehoff commented Nov 12, 2024

enocera commented Nov 27, 2024

comane commented Dec 8, 2024

achiefa commented Dec 9, 2024

comane commented Dec 9, 2024

achiefa commented Dec 9, 2024

comane commented Dec 9, 2024

achiefa left a comment

Choose a reason for hiding this comment

achiefa Dec 9, 2024

Choose a reason for hiding this comment

comane Dec 9, 2024

Choose a reason for hiding this comment

comane Dec 10, 2024

Choose a reason for hiding this comment

enocera commented Dec 11, 2024 • edited Loading

comane commented Dec 11, 2024

comane commented Nov 9, 2024 •

edited

Loading

enocera commented Dec 11, 2024 •

edited

Loading